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Abstract 

A time series that represents daily values of the WIG index (the main in- 
dex of Warsaw Stock Exchange) over last 5 years is examined. Non-Gaussian 
features of distributions of fluctuations, namely returns, over a time scale are 
considered. Some general properties like exponents of the long range corre- 
lation estimated by averaged volatility and detrended fluctuations analysis 
(DFA) as well as exponents describing a decay of tails of the cumulative dis- 
tributions are found. Closing, the Zipf analysis for the WIG index time series 
translated into three letter text is presented. 
PACS 05.40.Fb, 05.45. Tp, 89.90.+n 

I. INTRODUCTION 

Financial markets are complex dynamical systems with many interacting elements: in- 
dividual investors, mutual funds, brokerage firms, banks from one side and bonds, stocks, 
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futures, options from the other side. Interactions between elements lead to transactions me- 
diated by the stock exchange. The details of each transaction are recorded for analysis. The 
nature of interactions between different elements comprising a financial market is unknown 
as well as the way in which external factors affect the market. Therefore, as a starting point 
one may turn to empirical studies to inquire into regularities called "empirical laws" that 
may drive financial markets. 

In the following we present results of investigations of the WIG index - the index of 
Warsaw Stock Exchange. (WIG is an abbreviation of the Polish name.) The WIG index is 
calculated as a total return of the weighted sum of market capitalization (a portfolio of the 
index) of all stocks from the main market once per trading day after each session. Every 
three months the portfolio is revised. New stocks that entered the market in previous three 
months are added and the portfolio weights are corrected to preserve two rules: any market 
capitalization of a stock takes more than 10% and participation of one industry stocks do 
not exceed 30% of the whole market capitalization. The Warsaw Stock Exchange is a young 
market — the first session took place on the 16th April of 1991 and five stocks were traded. 
Now, i.e. on 1 August 2000 the portfolio of the WIG index enumerates 120 stocks of the 
main market and 61 stocks are traded on the parallel market. 

In this presentation we study a time series of returns of the WIG index for the period of 
5 last years: from September 1995 to August 2000. Thus we observe the emerging market 
in its second phase of development. Therefore we can assume that the market is stationary 
(see for emerging market studies). 

The presentation is organized as follows: 

In Section 2, we introduce a time series considered and give a definition for returns- one 
of the basic notions in a study of financial markets. 

In Section 3, we perform tests to estimate strength and character of long-range correlation 
of Warsaw market. We find an instance of the strongly antypersistant random walk occurring 
in the time horizon longer than three months. 

In Section 4, we discuss properties of probability density functions of returns. We show 
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that the central part of distribution of WIG index returns is well fitted by a Levy distribution. 
Some time scaling is therefore provided. The asymptotic behavior of the distribution of 
returns shows faster decay than predicted by a Levy distribution. Hence, our result confirms 
Mantegna and Stanley proposition 0,?] of a truncated Levy distribution as a model for the 
distribution of returns. The exponential truncation ensures the existence of a finite second 
moment what concludes, by limit theorems, that the asymptotic distribution of returns is a 
Gaussian distribution. 

The next section, Section 5, we translate the time series of WIG index values into a 
text The Zipf ||[5| analysis drives to the observation that non trivial correlation exist 

between successive daily fluctuations, so that some predictions are possible. 

II. DATA IN STUDY 

We analyze the values of the WIG index for the period of almost 5 years: from 1st 
September 1995 to 31st July 2000. The data have been recorded at each trading day. So that 
our basic data set consists of 1224 values. In Fig.l we present a time series corresponding to 
daily values of the WIG index. Together, we plot the WIG index detrended by the inflation 
rate. Because of high variety in time it is difficult to give an estimation of satisfactory 
quality for the total trend. Rough calculations for the series of the detrended WIG index 
suggest the trend is neither upwards nor downwards. The total trend over the last five years 
is horizontal. 

To test the nature of the stochastic process underlying the changes in the WIG index 
value we investigate the time series of returns on varying time scales At. Returns are the 
basic quantities that are widely studied in economic analysis, see f° r a literature 

collection. For a time series S(t) of prices or market index values, a time series of returns 
G(At, t) over a time scale At is obtained as the successive differences of the natural logarithm 
of price, || 

G(At,t) = lnS{t + At) -lnS(t) (1) 
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A normalized to the variance 1 series of returns g(At,t) is 

9 (A M ) = g(AM) -^ (At - t)>T (2, 

where v 2 (At) is the time averaged variance, 

v 2 {At) =< G 2 {At,t) > T - < G(At,t) > 2 T 

and < .. >t denotes an average over the entire length of a time series. A daily series of 
g(l, t) together with examples of time series when the data set is sampled at 5, 10 and 20 
trading days are shown in Fig. 2. 



III. LONG-TIME CORRELATION 

A commonly used and efficient test in detecting the presence of long-range correlation 
is based on investigation the standard deviation of returns as a function of different time 
horizon At. This function is called the time averaged volatility and is denoted v(At), M. 
(In economics, the volatility of a certain stock is a measure how much a price is likely to 
fluctuate at a given time. It can also be related to the amount of information arriving at 
any time. Such an instantaneous volatility can be calculated as the local average of the 
amplitude of the returns, [[|.) 

The volatility dependence on the time scale At allows to determine an exponent 5 of a 
power-law behavior 

v(At) ~ At 5 . (3) 

For a random walk 5 = 0.5. In mature markets the exponent 5 takes value larger than but 
close to 0.5 what indicates absence of correlation for the daily and monthly returns or a 
walk with a weak persistent character 0,0. 

Fig. 3a is the log-log plot of the volatility of the WIG index returns against time to extract 
the exponent 5. It shows that initially for two to five days, the exponent takes the value 
5 = 0.553 what implies a slightly persistent walk. Then, for a time period of few further 
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days, the exponent switches to 5 = 0.464 what can be recognized as slightly antypersistent 
(zigzag) walk. If longer intervals are considered such as one or two month time, the volatility 
grows with 5 = 0.557. (In brackets the Pearson r 2 correlation coefficient is shown to give 
the precision of the linear dependence.) Finally, on the time scale of three or more months, 
one observes 5 = 0.269(!). 

For verification the last value we have performed a similar analysis for a series of returns 
obtained from the data of other indices of the Warsaw Stock Exchange and some particular 
stocks. These results are shown in Figs 3b,c,d. The WIG-20 (twenty largest firms according 
to market capitalization) index exhibits the same features as the WIG index. The WIRR 
index (the index of the parallel market) and the prices of some particular stocks do not 
change the character of correlation on the three-month scale. Instead, we observe the ampli- 
fication of the non-random walk features. One can notice the presence of strong long-range 
correlation in the most of the time series. We think that the rapid change in the trend of 
fluctuations of the WIG and WIG20 indices is effected by corrections made in calculating 
procedure for these indices. Every three months the WIG and WIG20 indices formulas are 
changed. 

In Fig. 4 one can find a plot of the exponent a arising from the Detrended Fluctuation 
Analysis (DFA) method ||. Again, one can notice switching from the persistent random 
walk taking place in the first week time to antypersistant walk emerging in the second week. 
The growing errors of regression fits, measured by the Pearson r 2 correlation coefficient, do 
not allow us to present as for returns over the longer time horizon. 

IV. PROBABILITY DENSITY FUNCTION 

One of the key aspect in determining stochastic evolution of the value of a given financial 
asset is the shape of the probability density function (pdf ) of its returns. Fig. 5a shows on the 
semi-log plot the pdf of one-day returns G(At = 1). Searching for properties of experimental 
pdf of returns we fit the data to the Gaussian and Lorentzian distributions. 
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The rough analysis of Gaussianness of an empirical pdf can be done by considering the 
kurtosis - the fourth normalized cumulant. The kurtosis equals to for a Gaussian dis- 
tribution. When the kurtosis is positive — a leptokurtic distribution, the corresponding 
distribution density has a market peak around the mean and rather 'thick' tails. Conversely, 
when the kurtosis is negative, the distribution density has a flat top and very thin tails. 
Therefore, the kurtosis is considered as a measure of a distance between an empirical data 
distribution and a Gaussian curve. The leptokurtic character of a stock price returns as well 
as stock index returns has been reported by several authors since Mandelbrot observation 



Fig. 5b shows values of the kurtosis for returns over the time scale. The leptokurtic 
character of distributions of the WIG index returns is evident when the distribution of 
returns over a time scale shorter than two weeks (10 trading days) is considered. Over 
a time scale of two, three months one should observe the change of the pdf shape to the 
opposite flat character. 

In Fig. 6a the probability density functions for returns, namely G(At), over different 
time scales At are shown. The typical spreading is observed, as if a random walk is exam- 
ined. Having disagreement between the empirical pdf and a Gaussian distribution though 
supposing stability of distributions, we search for a symmetrical Levy distribution, 



To find the most appropriate stable Levy pdf we calculate the probability of return to the 
origin, i.e., Prob{G(At) = 0}, as it is usually proposed ||||. Plotting Prob{G(At) = 0} 
against different time scales At on the log- log plot, we can observe two time regimes with 
slightly different power-law behavior, see Fig. 6b. The returns of few days time scale lead 
to the exponent value 0.676 while the returns of few weeks time scale provide 0.578 for the 
exponent. The index fi of a Levy distribution is the inverse of the exponent and thereby 
we get todays = 1-470 and fj, wee ks = 1-730. Both values are well inside the Levy stable range 
< /i < 2. It is known that the value of [i for stock markets falls between 1.5 and 1.7 and 



in 60-ties, 0. 




(4) 
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is lower for more volatile markets |TT|,|3[| and on the time scales longer than few days one 



should expect a convergence to Gaussian behavior ||. Hence, according to our results the 
Warsaw stock market is highly volatile and this volatility slows down the convergence to 
Gaussian behavior. 

The stable distributions are self-similar — the distributions of returns for various choices 
of At have the similar functional form. Having value of /i, we rescale our empirical pdfs 
to observe data collapse, see Fig.6c. Applying suitable scaling to different pdfs we find two 
separated areas with collapsing data. One area corresponds to returns with few days time 
scale and the other to returns of few weeks time horizon. 

Searching for the breakdown of a Levy description in empirical pdfs, rare events are 
usually considered. The probability of rare events are estimated by studying properties of 
the cumulative distribution of the normalized returns g(t) 0|§. The cumulative probability 
distribution F(g) of observing a change g or larger in a series of returns was found to be 
power-law for large values of g, both for positive and negative values of g with exponent fi, 

F(g) = Prob{g > x} ~ <T" (5) 

what implies the following asymptotic behavior of pdfs, 

P ( x ) ~ i \ x \ l ar g e enough 



x 



The value of /i changes when one moves the interval of g. Our estimates are, Figs 7ab: 

0.76 positive tail 



for g € (0.3, 0.9) : fi 



for g 6 (0.9, 1.6): n 



0.69 negative tail 

2.03 positive tail 
1.83 negative tail 



for g > 1.6 : \i 



3.88 positive tail 
3.06 negative tail 

The presented results are consistent with the apparent power-law behavior in tails found 
for daily returns for stock market indices such as NIKKEI: \x ~ 3.05, Hang-Seng : \x rs 3.03 



or S&P500 : fi ~ 3.34, [j3,B|. Moreover, since fi > 2 when g > 1.6, hence it is outside the 
Levy stable regime, our investigations lead to the conclusion that the second moment of 
fluctuations of the WIG index is finite and one should expect the convergence to a Gaussian 
distribution because of the central limit theorem. One should notice that the value for fi 
found by study the asymptotic nature of the distribution is different from the value obtained 
when the center part of the distribution is considered. This effect is usually related to the 



slow decay of the volatility correlation p,13 



V. ZIPF PLOT OF WIG INDEX 

The original Zipf analysis || consists in counting the number of words of a certain type 
appearing in a text, calculating the frequency of occurrence / of each word in a given text, 
and sorting out the words according to their frequency, i.e., a rank R is assigned to each 
word, namely, R = 1 for the most frequent one. A power law 

/ ~ R' C (6) 

with an exponent ( is searched for on a log- log plot. 

The similar analysis can be done to any series of data when one translates the data into 
a text ||]. In the following we translated the WIG index time series into a text based on 
two alphabets. The first text is obtained by transforming data into binary sequences by 
replacing up and down daily fluctuations by a character u and d, respectively. Another 
transformation: u for large up d for large down fluctuations and s for small fluctuations of 
both kinds below a given threshold, leads to a three- letter text. 

In Fig. 8 we show the Zipf plot for words consisting of n = 4, 5 letters of both texts 
obtained from the time series of WIG index values. The probability of a single letter to 
be found in the text examined is as follows: P(u) = 0.533 and P(d) = 0.467 when the 
two-letter alphabet is considered and if we deal with three letter alphabet then depending 
on the value of the threshold s we obtain: 



S 



s= 2% 1% 0.6% 0.2% 0.1% 

d 0.103 0.234 0.316 0.406 0.441 

s 0.774 0.510 0.318 0.119 0.059 

u 0.123 0.255 0.366 0.475 0.500 

The data plotted are the effective frequencies P/P', i.e., the apparent probability P of a 
word divided by its expected probability of occurrence P' if there is no correlation [[|. The 
Zipf plot using Pj P' is expected to be horizontal for random sequences. 

The value of ( changes depending on the way in which a time series is transferred into 
a text: 

Two-letter alphabet: 







4 - letter 


words 


0.15 


(r 2 = 0.; 


34) 






5 — letter 


words 


0.18 


(r 2 = 0.; 


33) 


Three- letter alphabet: 












4-letter words: 
















s 


2% 


1% 


0.6% 


0.2% 


0.1% 




c 


0.86 


0.35 


0.24 


0.30 


0.64 


( 


r 2 ) 


(0.94) 


(0.98) 


(0.91) 


(0.98) 


(0.97) 


5-letter words: 
















C 


1.16 


0.41 


0.26 


0.37 


0.79 


( 


r 2 ) 


(0.95) 


(0.95) 


(0.91) 


(0.90) 


(0.98) 



The lowest value is observed when probabilities of particular letters are approximately equal 
to each other and this happen when the bias s = 0.6%. The value of ( in this case is 
distinguishable different from zero, hence, non trivial correlation exist between successive 
daily fluctuations of the WIG index returns. Feeling tempted to make some predictions we 
search for no risk sequences of events, i.e., for the conditional probability equal to almost 
one that after a sequence of four events the fifth even occurs. We have found one of such an 



event: Prob(d\ssud) ~ 1 if s — 0.6% is considered. The set of such events strongly depends 
on the threshold value. Therefore we think that this analysis should be done for price of a 
stock instead of market index. 



VI. CONCLUSION 

It appears that some tools of statistical physics which were developed to study critical 
phenomena in complex systems can be succesfully applied in rather distant from physics 
disciplines like medicine, genetics, sociology, politics or economy. Many physicists have 
been attracted to interdisciplinary investigations collectively called exotic statistical physics 



14| . Additional excitement comes along when one turns its interest into econophysics since 
one can learn not only how the economic system works but also how to use this knowledge 
to make money |fT5| . The authors of this presentation seem to be typical victims of such 
prospects. The main purpose of the presented paper is to apply standard analysis of exper- 
imental econophysics to Warsaw Stock Exchange index to discover how to succeed on the 
Polish stock market. 

Our conclusion that Warsaw Stock Exchange is volatile is not surprising. The Polish 
stock market, as it happens in all countries that change their economic systems, is small 
one, therefore, it is in an extremely fragile state. The absence of a general positive trend in 
time series of the WIG index suggests that speculations are the only way to get an income or 
growth. However, we have to emphasize that our conclusion is only preliminary. The data 
set considered by us is rather limited; moreover, we do not explore and carry out all possible 
tests. We do not perform Hurst analysis of the data nor we study long-range dependences 
which emerge when the time dependent volatility is being investigated. The cross correlation 
between different stocks also has not been considered here. Hence there is a need for further 
investigations to verify our initial findings. 
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Figure captions 

Fig.l Daily values of the WIG index in time. We display both the value of the WIG index 
(black dots) and the index detrended by inflation to the August 2000 of Polish zloty 
(empty dots). 

Fig. 2 Sequences of (a) one-day, (b) 5-day (a week), (c) 10-day, (d) 20-day (a month) returns 
extracted from the WIG index data. 

Fig. 3 Log-log plot of the time averaged volatility as a function of the time scale At of returns 
of (a) WIG index, (b)WIG20 index, (c) WIRR index (d) selected stocks. Notice, the 
rapid change of the correlation trend after three months in case the WIG and WIG-20 
indices which is not observed when the WIRR index or particular stock prices are 
considered. 

Fig. 4 Multifractality analysis by DFA method. The value of the index a is calculated for a 
time series that was created by merging one after one, all picked up series generated 
from our data set. So that, we merge two time sequences of G(2,t), three time sequences 
of G(3,t), etc., to deal with a time series of the reasonable length for DFA studies. 
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Fig. 5 (a) Linear-log plot of the empirical probability density function for At = 1 re- 
turns G(l) of the WIG index. The best fit curve to a Gaussian distribution 
is 26.7 exp{— 0.5( a; ~ °° 11 ) 2 }. The best fit curve to a Lorenztian distribution is 
— J?o.mi2) . 2 ■ (b) Kurtosis and skewness of the returns of WIG index. A positive 
value of kurtosis indicates a slow asymptotic decay. The 'leptokurtic' character of 
the distribution of the data implies that using a Gaussian one systematically under- 
estimates the probability of large fluctuations. The negative skewness says that a 
distribution is asymmetric around its mean and the negative tail of a distribution is 
larger than the positive one. 

Fig. 6 (a) Probability density function of returns G(At) measured at different time horizons: 
At = 1,2,3 and 10, 15, 21 days, (b) Probability of return to the origin measured as a 
function of the time interval At. A Levy distribution has a slope > 0.5, a Gaussian 
distribution has a slope = 0.5. (c) Probability density functions of the returns plotted 
in rescaled units. Pdfs of short time scale returns are rescaled with fidays and pdfs of 
returns of weeks time scale with fi wee ks- 

Fig. 7 (a) Log-log plot of the cumulative distribution of normalized WIG returns for At = 1 
day (negative tails), (b) Log-log plot of the cumulative distribution of normalized 
WIG returns for At = 1 day (positive tails). 

Fig. 8 Zipf plots for respectively the u, d and s, u, d sequences obtained from the series of 
WIG returns of Fig.l. The length of analyzed pattern is n = 4, 5. 
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WIG index in time 
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[d) 20 day (a month) returns 
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; Volatility of WIRR index returns 
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Volatility of some stocks returns 
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Zipf plot (log-log) 

for WIG series of Fig. 1 
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